Does Human Action Recognition Benefit from Pose Estimation?

نویسندگان

  • Angela Yao
  • Juergen Gall
  • Gabriele Fanelli
  • Luc Van Gool
چکیده

Introduction The earliest works in action recognition focused on tracking body parts and classifying the joint movements. These pose-based approaches, while straight-forward, require accurate tracking of body parts, which is a challenging task in its own right. As recent trends in action recognition have shifted towards natural and unconstrained videos (e.g. films, broadcast sports, Youtube videos), efforts have shifted from high-level modelling of the human body to directly classifying actions with abstract and low-level appearance features in appearance-based approaches. But despite requiring more initial processing, pose representations have several advantages. First, they have fewer intra-class variances; in particular, 3D skeleton poses are viewpoint and appearance invariant, such that actions vary less from actor to actor. Secondly, using pose representations simplifies learning for action recognition, since relevant highlevel information has already been extracted. Given the great progress in pose estimation over the past few years [1], we feel that pose-based action recognition systems warrant a second look. In this work, we compare pose-based and appearance-based features for action recognition as depicted in Fig. 1. Our pose-based features are derived from articulated 3D joint information; we label as appearancebased any feature which can be extracted from video data without explicit articulated modelling of the human body. For fair comparison, we apply the same action recognition system [4] to the two different sets of features. Finally, we combine the two feature types into a single system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Human Action Recognition with an Incomplete Real - Time Pose Skeleton

Currently, most human action recognition systems are trained with feature sets that have no missing data. Unfortunately, the use of human pose estimation models to provide more descriptive features also entails an increased sensitivity to occlusions, meaning that incomplete feature information will be unavoidable for realistic scenarios. To address the problem of occlusions, this paper proposes...

متن کامل

Classifying Human Actions Using an Incomplete Real-Time Pose Skeleton

Currently, most human action recognition systems are trained with feature sets that have no missing data. Unfortunately, the use of human pose estimation models to provide more descriptive features also entails an increased sensitivity to occlusions, meaning that incomplete feature information will be unavoidable for realistic scenarios. To address this, our approach is to shift the responsibil...

متن کامل

2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning

Action recognition and human pose estimation are closely related but both problems are generally handled as distinct tasks in the literature. In this work, we propose a multitask framework for jointly 2D and 3D pose estimation from still images and human action recognition from video sequences. We show that a single architecture can be used to solve the two problems in an efficient way and stil...

متن کامل

Human 3D Pose Estimation and Activity Recognition from Multi-View Videos: Comparative Explorations of Recent Developments

This paper presents a review and comparative study of recent multi-view approaches for human 3D pose estimation and activity recognition. We discuss the application domain of human pose estimation and activity recognition and the associated requirements, covering: advanced Human-Computer Interaction (HCI), assisted living, gesture-based interactive games, intelligent driver assistance systems, ...

متن کامل

Face Pose Estimation from Eyes and Mouth

Face pose estimation plays an important role in human computer interaction, automatic human behavior analysis, gaze estimation, virtual reality, pose independent face recognition etc. Accuracy and speed are the most desirable features of a face pose estimation system. In this paper, a face pose estimation scheme based on the centers of the eyes and mouth is proposed. The proposed method is simp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011